Thickness measurement of substrate using color metrology

ABSTRACT

A metrology system for obtaining a measurement representative of a thickness of a layer on a substrate includes a camera positioned to capture a color image of at least a portion of the substrate. A controller is configured to receive the color image from the camera, store a predetermined path in a coordinate space of at least two dimension including a first color channel and a second color channel, store a function that provides a value representative of a thickness as a function of a position on the predetermined path, determine a coordinate of a pixel in the coordinate space from color data in the color image for the pixel, determine a position of a point on the predetermined path that is closest to the coordinate of the pixel, and calculate a value representative of a thickness from the function and the position of the point on the predetermined path.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a divisional of U.S. patent application Ser. No. 16/435,291, filed Jun. 7, 2019, which is a continuation of U.S. patent application Ser. No. 15/686,785, filed Aug. 25, 2017, which claims priority to U.S. Provisional Application Ser. No. 62/379,920, filed Aug. 26, 2016, the disclosures of which are incorporated by reference.

TECHNICAL FIELD

This disclosure relates to optical metrology, e.g., to detect the thickness of a layer on a substrate.

BACKGROUND

An integrated circuit is typically formed on a substrate by the sequential deposition of conductive, semiconductive, or insulative layers on a silicon wafer. One fabrication step involves depositing a filler layer over a non-planar surface and planarizing the filler layer. For certain applications, the filler layer is planarized until the top surface of a patterned layer is exposed. A conductive filler layer, for example, can be deposited on a patterned insulative layer to fill the trenches or holes in the insulative layer. After planarization, the portions of the metallic layer remaining between the raised pattern of the insulative layer form vias, plugs, and lines that provide conductive paths between thin film circuits on the substrate. For other applications, such as oxide polishing, the filler layer is planarized until a predetermined thickness is left over the non planar surface. In addition, planarization of the substrate surface is usually required for photolithography.

Chemical mechanical polishing (CMP) is one accepted method of planarization. This planarization method typically requires that the substrate be mounted on a carrier or polishing head. The exposed surface of the substrate is typically placed against a rotating polishing pad. The carrier head provides a controllable load on the substrate to push it against the polishing pad. An abrasive polishing slurry is typically supplied to the surface of the polishing pad.

Variations in the slurry distribution, the polishing pad condition, the relative speed between the polishing pad and the substrate, and the load on the substrate can cause variations in the material removal rate. These variations, as well as variations in the initial thickness of the substrate layer, cause variations in the time needed to reach the polishing endpoint. Therefore, determining the polishing endpoint merely as a function of polishing time can lead to overpolishing or underpolishing of the substrate.

Various optical metrology systems, e.g., spectrographic or ellipsometric, can be used to measure the thickness of the substrate layer pre-polishing and post-polishing, e.g., at an in-line or stand-along metrology station. In addition, various in-situ monitoring techniques, such as monochromatic optical or eddy current monitoring, can be used to detect a polishing endpoint.

SUMMARY

In one aspect, a metrology system for obtaining a measurement representative of a thickness of a layer on a substrate includes a support to hold a substrate for integrated circuit fabrication, a color camera positioned to capture a color image of at least a portion of the substrate held by the support, and a controller. The controller is configured to receive the color image from the camera, store a predetermined path in a coordinate space of at least two dimension including a first color channel and a second color channel, store a function that provides a value representative of a thickness as a function of a position on the predetermined path, for a pixel of the color image determine a coordinate of the pixel in the coordinate space from color data in the color image for the pixel, determine a position of a point on the predetermined path that is closest to the coordinate of the pixel, and calculate a value representative of a thickness from the function and the position of the point on the predetermined path.

Another aspect is a computer program product including instructions for causing a processor for obtaining a measurement representative of a thickness of a layer on a substrate. Another aspect is a method of obtaining a measurement representative of a thickness of a layer on a substrate.

Implementations of any aspect include one or more of the following features.

The coordinate space may be two-dimensional or three-dimensional.

The first color channel and the selected color channel may selected from the group of color channels comprising hue, saturation, luminosity, X, Y, Z, red chromaticity, green chromaticity, and blue chromaticity. For example, the first color channel may be a red chromaticity and the second color channel may be a green chromaticity.

The controller may be configured to determine the position of the point on the predetermined path by determining a point on the path at which a normal vector to the thickness path passes through the coordinate. The controller may be configured to resolve a degeneracy in the predetermined path by analyzing a cluster of coordinates associated with pixels from a given physical region on the substrate surrounding the pixel. The controller may be configured to resolve the degeneracy in the predetermined path by determining a major axis of the cluster in the coordinate space and select a branch of the predetermine path that is most closely parallel to the major axis.

The function may include an interpolation between a first value for a first vertex of the predetermined path and a second value for a second vertex of the predetermined path based on a distance of the point from the first vertex. The first vertex is a starting point of the predetermined path and the second vertex is an end point of the predetermined path. The controller may be configured to store the predetermined path by storing one or more of a Bezier function or a polyline.

The controller may be configured to, for each of a plurality of pixels of the color image, determine a coordinate of the pixel in the coordinate space from color data in the color image for the pixel, determine the position of the point on the predetermined path that is closest to the coordinate of the pixel, and calculate the value representative of the thickness from the function and the position of the point on the predetermined path, to provide a plurality of values representative of thickness for a plurality of different positions on the substrate. The controller may be configured to count a number of pixels from the plurality of pixels in which the value representative of the thickness fails to meet a threshold. The controller may be configured to apply an image mask to the color image.

The color camera may be configured to capture an image of less than all of the substrate and is configured to scan across the substrate, and wherein the controller is configured to generate a 2-dimensional color image of all of the substrate from multiple images from the color camera. The color camera may be a color line-scan camera having detector elements arranged along a first axis that is parallel to the surface of the substrate during scanning of the substrate, and the system may include an elongated white light source having a longitudinal axis parallel to the first axis and configured to direct light toward the substrate at a non-zero incidence angle during scanning of the substrate, a frame supporting the light source and the camera, and a motor to cause relative motion between the frame and the support along a second axis perpendicular to the first axis to cause the light source and the camera to scan across the substrate.

The support may be positioned in an in-line metrology station of a semiconductor fabrication tool. The semiconductor fabrication tool may be a chemical mechanical polisher, and the tool may include a robot to transfer the substrate into the in-line metrology station, and the controller may be configured to cause the substrate to be measured by before or after polishing of a surface of the substrate by the chemical mechanical polisher.

Implementations can include one or more of the following potential advantages. Thickness of a location on a substrate layer can be determined, either absolutely or relative to other portions of the substrate layer, and unacceptable variations can be detected. This information can be used in a feed-forward or feed-back use to control polishing parameters, providing improved thickness uniformity. The algorithm to determine variations can be simple and have low computational load.

The details of one or more implementations are set forth in the accompanying drawings and the description below. Other aspects, features and advantages will be apparent from the description and drawings, and from the claims.

DESCRIPTION OF DRAWINGS

FIG. 1 illustrates a schematic view of an example of an in-line optical measurement system.

FIG. 2 is a flow chart for a method of determining layer thickness.

FIG. 3 is a schematic top view of a substrate.

FIG. 4 is a schematic view of a mask.

FIG. 5 illustrates an example graph showing evolution of color of light reflected from a substrate in a coordinate space of two color channels.

FIG. 6 illustrates an example graph showing a predetermined path in the coordinate space of two color channels.

FIG. 7 is a flow chart for a method of determining layer thickness from color image data.

FIG. 8 illustrates an example graph showing a histogram in the coordinate space of two color channels derived from a color image of a test substrate.

Like reference symbols in the various drawings indicate like elements.

DETAILED DESCRIPTION

The thickness of a layer on a substrate can be optically measured before or after polishing, e.g., at an in-line or stand-alone metrology station. However, some optical techniques such as spectrometry require expensive spectrographs and computationally heavy manipulation of spectra data. Even apart from computational load, in some situations the algorithm results do not meet the ever increasing accuracy requirements of the user. However, another metrology technique is to take a color image of the substrate, and analyze the image in a color space to determine the thickness of the layer. In particular, the position along a path in a 2-dimensional color space can provide information on the current state of polishing, e.g., amount removed or amount of material remaining.

Referring to FIG. 1, a polishing apparatus 100 includes an in-line (also referred to as in-sequence) optical metrology system 160, e.g., a color imaging system.

The polishing apparatus 100 includes one or more carrier heads 126, each of which is configured to carry a substrate 10, one or more polishing stations 106, and a transfer station to load substrate to and unload substrates from a carrier head. Each polishing station 106 includes a polishing pad 130 supported on a platen 120. The polishing pad 110 can be a two-layer polishing pad with an outer polishing layer and a softer backing layer.

The carrier heads 126 can be suspended from a support 128, and movable between the polishing stations. In some implementations, the support 128 is an overhead track and the carrier heads 126 are coupled to a carriage 108 that is mounted to the track. The overhead track 128 allows each carriage 108 to be selectively positioned over the polishing stations 124 and the transfer station. Alternatively, in some implementations the support 128 is a rotatable carousel, and rotation of the carousel moves the carrier heads 126 simultaneously along a circular path.

Each polishing station 106 of the polishing apparatus 100 can include a port, e.g., at the end of an arm 134, to dispense polishing liquid 136, such as abrasive slurry, onto the polishing pad 130. Each polishing station 106 of the polishing apparatus 100 can also include pad conditioning apparatus to abrade the polishing pad 130 to maintain the polishing pad 130 in a consistent abrasive state.

Each carrier head 126 is operable to hold a substrate 10 against the polishing pad 130. Each carrier head 126 can have independent control of the polishing parameters, for example pressure, associated with each respective substrate. In particular, each carrier head 126 can include a retaining ring 142 to retain the substrate 10 below a flexible membrane 144. Each carrier head 126 also includes a plurality of independently controllable pressurizable chambers defined by the membrane, e.g., three chambers 146 a-146 c, which can apply independently controllable pressurizes to associated zones on the flexible membrane 144 and thus on the substrate 10. Although only three chambers are illustrated in FIG. 1 for ease of illustration, there could be one or two chambers, or four or more chambers, e.g., five chambers.

Each carrier head 126 is suspended from the support 128, and is connected by a drive shaft 154 to a carrier head rotation motor 156 so that the carrier head can rotate about an axis 127. Optionally each carrier head 140 can oscillate laterally, e.g., by driving the carriage 108 on the track 128, or by rotational oscillation of the carousel itself. In operation, the platen is rotated about its central axis 121, and each carrier head is rotated about its central axis 127 and translated laterally across the top surface of the polishing pad. The lateral sweep is in a direction parallel to the polishing surface 212. The lateral sweep can be a linear or arcuate motion.

A controller 190, such as a programmable computer, is connected to each motor to independently control the rotation rate of the platen 120 and the carrier heads 126. For example, each motor can include an encoder that measures the angular position or rotation rate of the associated drive shaft. Similarly, the controller 190 is connected to an actuator in each carriage 108 and/or the rotational motor for the carousel to independently control the lateral motion of each carrier head 126. For example, each actuator can include a linear encoder that measures the position of the carriage 108 along the track 128.

The controller 190 can include a central processing unit (CPU), a memory, and support circuits, e.g., input/output circuitry, power supplies, clock circuits, cache, and the like. The memory is connected to the CPU. The memory is a non-transitory computable readable medium, and can be one or more readily available memory such as random access memory (RAM), read only memory (ROM), floppy disk, hard disk, or other form of digital storage. In addition, although illustrated as a single computer, the controller 190 could be a distributed system, e.g., including multiple independently operating processors and memories.

The in-line optical metrology system 160 is positioned within the polishing apparatus 100, but does not perform measurements during the polishing operation; rather measurements are collected between polishing operations, e.g., while the substrate is being moved from one polishing station to another or from or to the transfer station.

The in-line optical metrology system 160 includes a sensor assembly 161 supported at a position between two of the polishing stations 106, e.g., between two platens 120. In particular, the sensor assembly 161 is located at a position such that a carrier head 126 supported by the support 128 can position the substrate 10 over the sensor assembly 161.

In implementations in which the polishing apparatus 100 include three polishing stations and carries the substrates sequentially from the first polishing station to the second polishing station to the third polishing station, one or more sensor assemblies 161 can be positioned between the transfer station and the first polishing station, between first and second polishing stations, between the second and third polishing stations, and/or between the third polishing station and the transfer station.

The sensor assembly 161 can include a light source 162, a light detector 164, and circuitry 166 for sending and receiving signals between the controller 190 and the light source 162 and light detector 164.

The light source 162 can be operable to emit white light. In one implementation, the white light emitted includes light having wavelengths of 200-800 nanometers. A suitable light source is an array of white-light light emitting diodes (LEDs), or a xenon lamp or a xenon mercury lamp. The light source 162 is oriented to direct light 168 onto the exposed surface of the substrate 10 at a non-zero angle of incidence α. The angle of incidence a can be, for example, about 30° to 75°, e.g., 50°.

The light source can illuminate a substantially linear elongated region that spans the width of the substrate 10. For the light source can 162 can include optics, e.g., a beam expander, to spread the light from the light source into an elongated region. Alternatively or in addition, the light source 162 can include a linear array of light sources. The light source 162 itself, and the region illuminated on the substrate, can elongated and have a longitudinal axis parallel to the surface of the substrate.

A diffuser 170 can be placed in the path of the light 168, or the light source 162 can include a diffuser, to diffuse the light before it reaches the substrate 10.

The detector 164 is a color camera that is sensitive to light from the light source 162. The camera includes an array of detector elements. For example, the camera can include a CCD array. In some implementations, the array is a single row of detector elements. For example, the camera can be a linescan camera. The row of detector elements can extend parallel to the longitudinal axis of the elongated region illuminated by the light source 162. Where the light source 162 includes a row of light emitting elements, the row of detector elements can extend along a first axis parallel to the longitudinal axis of the light source 162. A row of detector elements can include 1024 or more elements.

The camera 164 is configured with appropriate focusing optics 172 to project a field of view of the substrate onto the array of detector elements 178. The field of view can be long enough to view the entire width of the substrate 10, e.g., 150 to 300 mm long. The camera 164, including associated optics 172, can be configured such that individual pixels correspond to a region having a length equal to or less than about 0.5 mm. For example, assuming that the field of view is about 200 mm long and the detector 164 includes 1024 elements, then an image generated by the linescan camera can have pixels with a length of about 0.5 mm. To determine the length resolution of the image, the length of the field of view (FOV) can be divided by the number of pixels onto which the FOV is imaged to arrive at a length resolution.

The camera 164 can be also be configured such that the pixel width is comparable to the pixel length. For example, an advantage of a linescan camera is its very fast frame rate. The frame rate can be at least 5 kHz. The frame rate can be set at a frequency such that as the imaged area scans across the substrate 10, the pixel width is comparable to the pixel length, e.g., equal to or less than about 0.3 mm.

The light source 162 and the light detector 164 can be supported on a stage 180. Where the light detector 164 is a line-scan camera, the light source 162 and camera 164 are movable relative to the substrate 10 such that the imaged area can scan across the length of the substrate. In particular, the relative motion can be in a direction parallel to the surface of the substrate 10 and perpendicular to the row of detector elements of the linescan camera 164.

In some implementations, the stage 182 is stationary, and the carrier head 126 moves, e.g., either by motion of the carriage 108 or by rotational oscillation of the carousel. In some implementations, the stage 180 is movable while the carrier head 126 remains stationary for the image acquisition. For example, the stage 180 can be movable along a rail 184 by a linear actuator 182. In either case, this permits the light source 162 and camera 164 to stay in a fixed position relative to each other as the area being scanned moves across the substrate 10.

A possible advantage of having a line-scan camera and light source that move together across the substrate is that, e.g., as compared to a conventional 2D camera, the relative angle between the light source and the camera remains constant for different positions across the wafer. Consequently, artifacts caused by variation in the viewing angle can be reduced or eliminated. In addition, a line scan camera can eliminate perspective distortion, whereas a conventional 2D camera exhibits inherent perspective distortion, which then needs to be corrected by an image transformation.

The sensor assembly 161 can include a mechanism to adjust vertical distance between the substrate 10 and the light source 162 and detector 164. For example, the sensor assembly 161 can an actuator to adjust the vertical position of the stage 180.

Optionally a polarizing filter 174 can be positioned in the path of the light, e.g., between the substrate 10 and the detector 164. The polarizing filter 184 can be a circular polarizer (CPL). A typical CPL is a combination of a linear polarizer and quarter wave plate. Proper orientation of the polarizing axis of the polarizing filter 184 can reduce haze in the image and sharpen or enhance desirable visual features.

Assuming that the outermost layer on the substrate is a semitransparent layer, e.g., a dielectric layer, the color of light detected at detector 164 depends on, e.g., the composition of the substrate surface, substrate surface smoothness, and/or the amount of interference between light reflected from different interfaces of one or more layers (e.g., dielectric layers) on the substrate.

As noted above, the light source 162 and light detector 164 can be connected to a computing device, e.g., the controller 190, operable to control their operation and receive their signals.

Referring to FIG. 2, the controller assembles the individual image lines from the light detector 164 into a two-dimensional color image (step 200). The camera 164 can include separate detector elements for each of red, blue and green. The two-dimensional color image can include a monochromatic image 204, 206, 208 for each of the red, blue and green color channels.

The controller can apply an offset and/or a gain adjustment to the intensity values of the image in each color channel (step 210). Each color channel can have a different offset and/or gain.

Optionally, the image can be normalized (step 220). For example, the difference between the measured image and a standard predefined image can be calculated. For example, the controller can store a background image for each of the red, green and blue color channels, and the background image can be subtracted from the measured image for each color channel. Alternatively, the measured image can be divided by the standard predefined image.

The image can be filtered to remove low-frequency spatial variations (step 230). In some implementations, the image is transformed from a red green blue (RGB) color space to a hue saturation luminance (HSL) color space, the filter is applied in the HSL color space, and then the image is transformed back red green blue (RGB) color space. For example, in the HSL color space, the luminance channel can filtered to remove low-frequency spatial variations, i.e., hue and saturation channels are not filtered. In some implementations, the luminance channel is used to generate the filter, which is then applied to the red, green and blue images.

In some implementations, the smoothing is performed only along the first axis. For example, the luminance values of the pixels along the direction of travel 186 can be averaged together to provide an average luminance value that is a function of just the position along the first axis. Then each row of image pixels can be divided by the corresponding portion of the average luminance value that is a function of the position along the first axis.

The controller can use image processing techniques to analyze the image to locate a wafer orientation feature 16, e.g., a wafer notch or wafer flat, on the substrate 10 (see FIG. 4) (step 240). Image processing techniques can also be used to locate a center 18 of the substrate 10 (see FIG. 4).

Based on this data, the image is transformed, e.g., scaled and/or rotated and/or translated, into a standard image coordinate frame (step 250). For example, the image can be translated so that the wafer center is at the center point of the image and/or the image can be scaled so that the edge of the substrate is at the edge of the image, and/or the image can be rotated so that there is a Wangle between the x-axis of the image and the radial segment connecting the wafer center and the wafer orientation feature.

Optionally, an image mask can applied to screen out portions of the image data (step 260). For example, referring to FIG. 3, a typical substrate 10 includes multiple dies 12. Scribe lines 14 can separate the dies 12. For some applications, it may be useful to process only image data corresponding to the dies. In this case, referring to FIG. 4, an image mask can be stored by the controller, with unmasked region 22 corresponding in spatial position to the dies 12 and masked region 24 corresponding to the scribe lines 14. Image data corresponding to the masked region 24 is not processed or not used during the thresholding step. Alternatively, the masked region 24 could correspond to the dies so that the unmasked region corresponds to the scribe lines, or the unmasked region could be only a portion of each die with a remainder of each die being masked, or the unmasked region could be specific die or dies with remaining dies and scribe lines being masked, the unmasked region could be just a portion of specific die or dies, with a remainder of each die the substrate being masked. In some implementations, the user can define the mask using a graphical user interface on the controller 190.

The color data at this stage can be used to calculate a value representative of thickness (step 270). This value could be a thickness, or an amount of material removed, or a value indicating the amount of progress through the polishing process (e.g., as compared to a reference polishing process). The calculation can be performed for each unmasked pixel in the image. This value can then be used in a feed-forward or feed-back algorithm use to control polishing parameters, providing improved thickness uniformity. For example, the value for each pixel can be compared to a target value to generate an error signal image, and this error signal image can be used for feed-forward or feed-back control.

Some background to assist in understanding of the calculation of the value representative will be discussed. For any given pixel from the color image, a pair of values corresponding to two color channels can be extracted from the color data for the given pixel. Thus, each pair of values can define a coordinate in a coordinate space of a first color channel and a different second color channel. Possible color channels include hue, saturation, luminosity, X, Y, Z (e.g., from the CIE 1931 XYZ color space), red chromaticity, green chromaticity, and blue chromaticity.

Referring to FIG. 5, for example, when polishing begins, the pair of values (e.g., V1 ₀, V2 ₀) defines an initial coordinate 502 in the coordinate space 500 of the two color channels. However, because the spectrum of reflected light changes as polishing progresses, the color composition of the light changes, and the values (V1, V2) in the two color channels will change. Consequently the location of the coordinate within the coordinate space of the two color channels will change as polishing progresses, tracing out a path 504 in the coordinate space 500.

Referring to FIGS. 6 and 7, to calculate the value representative of thickness, a predetermined path 604 in the coordinate space 500 of the two color channels is stored (step 710), e.g., in a memory of the controller 190. The predetermined path is generated prior to measurement of the substrate. The path 404 can proceed from a starting coordinate 402 to an end coordinate 406. The path 404 can represent an entire polishing process, with the starting coordinate 402 corresponding to a starting thickness for the layer on the substrate and the end coordinate corresponding to a final thickness for the layer. Alternatively, the path can represent just a portion of a polishing process, e.g., the expected distribution of layer thicknesses across a substrate at the polishing endpoint.

In some implementations, to generate the predetermined path 404, a set-up substrate is polished to approximately the target thickness that will be used for device substrates. A color image of the set-up substrate is obtained using the optical metrology system 160. Since the polishing rate across the substrate is typically not uniform, different locations on the substrate will have different thicknesses, and thus reflect different colors, and thus have different coordinates with in the coordinate space of the first color channel and the second color channel.

Referring to FIG. 8, a two-dimensional (2D) histogram is computed using pixels contained within the unmasked regions. That is, using the color image, a scatter plot 800 is generated in the coordinate space of the first color channel and the second color channel using the coordinate values for some or all of the pixels from the unmasked portions of the set-up substrate. Each point 802 in the scatter plot is the pair of values (V1, V2) for the two color channels for a particular pixel. The scatter plot 800 can be displayed on a display of the controller 190 or another computer.

As noted above, possible color channels include hue, saturation, luminosity, X, Y, Z (e.g., from the CIE 1931 XYZ color space), red chromaticity, green chromaticity, and blue chromaticity. In some implementations, the first color channel is a red chromaticity (r) and the second color channel is a green chromaticity (g) which can be defined by

$r = {{\frac{R}{R + G + B}\mspace{14mu}{and}\mspace{14mu} g} = \frac{R}{R + G + B}}$

respectively, where R, G and B are the intensity values for the red, green and blue color channels of the color image.

The thickness path 604 can be created manually by a user, e.g., the operator of the semiconductor manufacturing facility, using a graphical user interface in conjunction with a computer, e.g., the controller 190. For example, while the scatter plot is being displayed, the user can manually construct a path that follows and overlies the scatter plot, e.g., using mouse operations to click on selected points of the display in the scatter plot.

Alternatively, the thickness path 604 can be generated automatically using software that is designed to analyze the set of coordinates in the scatter plot and generate a path that fits the points in the scatter plot 800, e.g., using topological skeletonization.

The thickness path 604 can be provided by a variety of functions, e.g., using a single line, a polyline, one or more circular arcs, one or more Bezier curves, and the like. In some implementations, the thickness path 604 is provided by a polyline, which is a set of line segments drawn between discrete points in the coordinate space.

Returning to FIG. 6, a function provides a relationship between positions on the predetermined thickness path 604 and thickness values. For example, the controller 190 can store a first thickness value for a starting point 602 of the predetermined thickness path 604, and a second thickness value for the ending point 606 of the predetermined thickness path 604. The first and second thickness values can be obtained by using a conventional thickness metrology system to measure the thickness of the substrate layer at the locations corresponding to the pixels that provide the points 802 closest to the starting point 602 and ending point 606, respectively.

In operation, the controller 190 can calculate a value representing the thickness at a given point 610 on the path 604 by interpolating between the first and second values based on the distance along the path 604 from the starting point 602 to the given point 610. For example, if the controller could calculate a thickness T of a given point 610 according to

$T = {{T1} + {\frac{D}{L}\left( {{T2} - {T1}} \right)}}$

where T1 is the value for the starting point 602, T2 is the thickness for the ending point 606, L is the total distance along the path between the starting point 602 and the ending point 606, and D is the distance along the path between the starting point 602 and the given point 610.

As another example, the controller 190 can store a thickness value for each vertex of on the predetermined thickness path 604, and calculate a value representing thickness for a given point on the path based on interpolation between the two nearest vertices. For this configuration, the various values for the vertexes can be obtained by using a conventional thickness metrology system to measure the thickness of the substrate layer at the locations corresponding to the pixels that provide the points 802 closest to the vertexes.

Other functions relating the position on the path to thickness are possible.

In addition, rather measure the thickness of the set-up substrate using a metrology system, the thickness values could be obtained calculated based on an optical model.

The thickness values can be actual thickness value if one uses theoretical simulation or empirical learning based on a known “setup” wafer. Alternatively, the thickness value at a given point on the predetermined thickness path can be a relative value, e.g., relative to a degree of polishing of the substrate. This later value can be scaled in a downstream process to obtain an empirical value or can be used simply to express increases or decreases in thickness without specifying absolute thickness values.

Referring to FIGS. 6 and 7, for the pixel being analyzed from the image of the substrate, the values for the two color channels are extracted from the color data for that pixel (step 720). This provides a coordinate 620 in the coordinate system 600 of the two color channels.

Next, the point, e.g., point 610, on the predetermined thickness path 604 that is closest to the coordinate 620 for the pixel is calculated (step 730). In this context, “closest” does not necessarily indicate geometrical perfection. The “closest” point could be defined in various ways, and limitations of processing power, selection of search function for ease of calculation, presence of multiple local maxima in the search function, and such can prevent determination of a geometrical ideal, but still provide results that are good enough for use. In some implementations, the closest point is defined as the point on the thickness path 604 which defines the normal vector to the thickness path that passes through the coordinate 620 for the pixel. In some implementations, the closest point is calculated by minimizing a Euclidean distance.

Then the value representing thickness is calculated from the function based on the position of the point 610 on the path 604, as discussed above (step 740). The closest point is not necessarily one of the vertexes points of the polyline. As noted above, in this case, interpolation can be used to obtain the thickness value (e.g. based on simple linear interpolation between the nearest vertexes of the polyline).

By iterating steps 720-740 for some or all of the pixels in the color image, a map of the thickness of the substrate layer can be generated. For some layer stacks on the substrate, the predetermined thickness path will cross itself, which leads to a situation referred to as a degeneracy. A degenerate point (e.g., point 650) on the predetermined thickness path has two or more thickness values associated with it. Consequently, without some additional information it may not be possible to know which thickness value is the correct value. However, it is possible to analyze the properties of a cluster of coordinates associated with pixels from a given physical region on the substrate, e.g., within a given die, and resolve the degeneracy using this additional information. For example, measurements within a given small region of the substrate can be assumed to not vary significantly, and thus would occupy a smaller section along of the scatter plot, i.e., would not extend along both branches.

As such, the controller can analyze a cluster of coordinates associated with pixels from a given physical region on the substrate surrounding the pixel for which the degeneracy needs to be resolved. In particular, the controller can determine a major axis of the cluster in the coordinate space. The branch of the predetermined thickness path that is most closely parallel to the major axis of the cluster can be selected and used to calculate the value representing thickness.

Returning to FIG. 2, optionally a uniformity analysis can be performed for each region of the substrate, e.g., each die, or for the entire image (step 280). For example, the value for each pixel can be compared to a target value, and the total number of “failing” pixels within a die, i.e., pixels that do not meet the target value, can be calculated for the die. This total can be compared to a threshold to determine whether the die is acceptable, e.g., if the total is less than the threshold then the die is marked as acceptable. This gives a pass/fail indication for each die.

As another example, the total number of “failing” pixels within the unmasked region of the substrate can be calculated. This total can be compared to a threshold to determine whether the substrate is acceptable, e.g., if the total is less than the threshold then the substrate is marked as acceptable. The threshold can be set by a user. This gives a pass/fail indication for the substrate.

Where a die or a wafer is determined to “fail”, the controller 190 can generate an alert or cause the polishing system 100 to take corrective action. For example, an audible or visual alert can be generated, or a data file can be generated indicating that a particular die is not usable. As another example, a substrate could be sent back for rework.

In contrast to spectrographic processing, in which a pixel typically is represented by 1024 or more intensity values, in a color image a pixel can be represented by just three intensity values (red, green and blue), and just two color channels are needed for the calculation. Consequently, the computational load to process the color image is significantly lower.

In general, data can be used to control one or more operation parameters of the CMP apparatus. Operational parameters include, for example, platen rotational velocity, substrate rotational velocity, the polishing path of the substrate, the substrate speed across the plate, the pressure exerted on the substrate, slurry composition, slurry flow rate, and temperature at the substrate surface. Operational parameters can be controlled real-time, and can be automatically adjusted without the need for further human intervention.

As used in the instant specification, the term substrate can include, for example, a product substrate (e.g., which includes multiple memory or processor dies), a test substrate, a bare substrate, and a gating substrate. The substrate can be at various stages of integrated circuit fabrication, e.g., the substrate can be a bare wafer, or it can include one or more deposited and/or patterned layers. The term substrate can include circular disks and rectangular sheets.

However, the color image processing technique described above can be particularly useful in the context of 3D vertical NAND (VNAND) flash memory. In particular, the layer stack used in fabrication of VNAND is so complicated that current metrology methods (e.g., Nova spectrum analysis) may be unable to perform with sufficiently reliability in detecting regions of improper thickness. In contrast, the color image processing technique can have superior throughput.

Embodiments of the invention and all of the functional operations described in this specification can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structural means disclosed in this specification and structural equivalents thereof, or in combinations of them. Embodiments of the invention can be implemented as one or more computer program products, i.e., one or more computer programs tangibly embodied in a non-transitory machine readable storage media, for execution by, or to control the operation of, data processing apparatus, e.g., a programmable processor, a computer, or multiple processors or computers.

Terms of relative positioning are used to denote positioning of components of the system relative to each other, not necessarily with respect to gravity; it should be understood that the polishing surface and substrate can be held in a vertical orientation or some other orientations.

A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example

-   -   Rather than a line scan camera, a camera that images the entire         substrate could be used. In this case, motion of the camera         relative to the substrate is not needed.     -   The camera could cover less than the entire width of the         substrate. In this case, the camera would need to undergo motion         in two perpendicular directions, e.g., be supported on an X-Y         stage, in order to scan the entire substrate.     -   The light source could illuminate the entire substrate. In this         case, the light source need not move relative to the substrate.     -   The light detector can be a spectrometer rather than a color         camera; the spectra data can then be reduced to the appropriate         color channels.     -   Although coordinates represented by pairs of values in a         2-dimensional coordinate space are discussed above, the         technique is applicable to coordinate spaces with three or more         dimensions defined by three or more color channels.     -   The sensory assembly need not an in-line system positioned         between polishing stations or between a polishing station and a         transfer station. For example, the sensor assembly could be         positioned within the transfer station, positioned in a cassette         interface unit, or be a stand-alone system.     -   The uniformity analysis step is optional. For example, the image         generated by applying the threshold transformation can be fed         into a feed-forward process to adjust a later processing step         for the substrate, or into a feed-back process to adjust the         processing step for a subsequent substrate.     -   Although an in-line system is described, the techniques could be         applied to in-situ measurements, i.e., measurements while the         substrate is being polished. In this case, various components of         the monitoring system could be installed in a recess in the         platen, and rotation of the platen can cause the components to         scan across the substrate. For in-situ measurements, rather than         construct an image, the monitoring system could simply detect         the color of a white light beam reflected from a spot on the         substrate, and use this color data to determine the thickness         using the techniques described above.     -   Although the description has focused on polishing, the         techniques can be applied to other sorts of semiconductor         fabrication processes that add or remove layers and that can be         optically monitored, such as etching and deposition.

Accordingly, other implementations are within the scope of the claims. 

What is claimed is:
 1. A computer program product for obtaining values representative of a plurality of thicknesses of a plurality of locations on a substrate, the computer program product tangibly embodied in a non-transitory computer readable medium, comprising instructions for causing a processor to: store an image mask representing portions of the image to screen out; receive a color image of the substrate from a color camera; transform the color image to a standard coordinate frame; apply the image mask to the color image in the standard coordinate frame to generate a masked color image; for each unmasked pixel in the masked color image, determine a coordinate of the pixel in a coordinate space of at least two dimensions including a first color channel and a second color channel from color data in the color image for the pixel, and determine a value representative of a thickness of a layer on the substrate based on the coordinate of the pixel in the coordinate space.
 2. The computer program product of claim 1, wherein the image mask comprises a plurality of rectangular unmasked regions separated by gaps that provide the portions of the image to screen out.
 3. The computer program product of claim 1, wherein the portions of the image to mask out represent scribe lines on a substrate.
 4. The computer program product of claim 1, comprising instructions to locate a wafer orientation feature and a wafer center.
 5. The computer program product of claim 4, comprising instructions to rotate the color image such that that there is a Wangle between the x-axis of the image and a radial segment connecting the wafer center and the wafer orientation feature.
 6. The computer program product of claim 1, wherein the first color channel and the selected color channel are selected from the group of color channels comprising hue, saturation, luminosity, X, Y, Z, red chromaticity, green chromaticity, and blue chromaticity.
 7. The computer program product of claim 6, wherein the first color channel is a red chromaticity and the second color channel is a green chromaticity.
 8. The computer program product of claim 1, comprising instructions to store a function that provides a value representative of a thickness as a function of a position on a predetermined path in the coordinate space; for each pixel of a plurality of pixels at different locations in the color image, determine a position of a point on the predetermined path that is closest to the coordinate of the pixel; and calculate the value representative of a thickness of a layer on the substrate from the function and the position of the point on the predetermined path.
 9. The computer program product of claim 8, wherein the coordinate space is two-dimensional.
 10. The computer program product of claim 8, wherein the coordinate space is three-dimensional.
 11. The computer program product of claim 1, comprising instructions to receive a sequence of color images with each image capturing less than all of the substrate and to generate a 2-dimensional color image of all of the substrate from the sequence of images. 